Clustering of Correlated Documents into Designated Number of Clusters: A Practical Approach

نویسنده

  • Ali Cevahir
چکیده

We consider a complete graph to cluster the vertices into k-clusters, where each edge of the graph is labeled either as “+” or “–”. “+” denotes that vertices incident to the edge are mutually related and “–” edge denotes that the vertices incident to the edge are mutually unrelated. The goal of the clustering is to place vertices into k clusters, where documents are clustered with maximally related items. That is, clustering should maximize the agreements (“+” edges inside the clusters and “–” edges between the clusters), or equivalently minimizes the disagreements (“–” edges inside the clusters and “+” edges between the clusters). We give a simple algorithm for the maximizing the agreements and test the success of the algorithm. We compare approach with Bansal et. al’s approach proposed in [1]., and conclude that complicated algorithms that have exponential run time are not practical.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data Clustring Using A New CGA(Chaotic-Generic Algorithm) Approach

Clustering is the process of dividing a set of input data into a number of subgroups. The members of each subgroup are similar to each other but different from members of other subgroups. The genetic algorithm has enjoyed many applications in clustering data. One of these applications is the clustering of images. The problem with the earlier methods used in clustering images was in selecting in...

متن کامل

Data Clustring Using A New CGA(Chaotic-Generic Algorithm) Approach

Clustering is the process of dividing a set of input data into a number of subgroups. The members of each subgroup are similar to each other but different from members of other subgroups. The genetic algorithm has enjoyed many applications in clustering data. One of these applications is the clustering of images. The problem with the earlier methods used in clustering images was in selecting in...

متن کامل

Comparison of Strategic Plans of Universities and Institutes of Higher Education with a Quantitative Approach

Strategic planning in Iranian universities and institutes of higher education is generally prepared using strategic planning models introduced by experts and other universities. These programs will be published in the form of university strategic planning documents. These documents have such features that can be similar or different than the programming templates used. Existence of the similar...

متن کامل

New Approach for Customer Clustering by Integrating the LRFM Model and Fuzzy Inference System

This study aimed at providing a systematic method to analyze the characteristics of customers’ purchasing behavior in order to improve the performance of customer relationship management system. For this purpose, the improved model of LRFM (including Length, Recency, Frequency, and Monetary indices) was utilized which is now a more common model than the basic RFM model apt for analyzing the cus...

متن کامل

A clustering approach for mineral potential mapping: A deposit-scale porphyry copper exploration targeting

This work describes a knowledge-guided clustering approach for mineral potential mapping (MPM), by which the optimum number of clusters is derived form a knowledge-driven methodology through a concentration-area (C-A) multifractal analysis. To implement the proposed approach, a case study at the North Narbaghi region in the Saveh, Markazi province of Iran, was investigated to discover porphyry ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005